Detecting and interpreting real-world and security gestures on touch and hover sensitive devices

ABSTRACT

“Real-world” gestures such as hand or finger movements/orientations that are generally recognized to mean certain things (e.g., an “OK” hand signal generally indicates an affirmative response) can be interpreted by a touch or hover sensitive device to more efficiently and accurately effect intended operations. These gestures can include, but are not limited to, “OK gestures,” “grasp everything gestures,” “stamp of approval gestures,” “circle select gestures,” “X to delete gestures,” “knock to inquire gestures,” “hitchhiker directional gestures,” and “shape gestures.” In addition, gestures can be used to provide identification and allow or deny access to applications, files, and the like.

CROSS REFERENCES TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.12/753,754, filed Apr. 2, 2010 and published on Jul. 29, 2010 as U.S.Patent Publication No. 2010/0192109, which is a divisional of U.S.patent application Ser. No. 11/818,500, filed Jun. 13, 2007 and issuedon Jan. 25, 2011 as U.S. Pat. No. 7,877,707, which claims the benefitunder 35 USC 119(e) of U.S. Patent Application No. 60/879,191, filedJan. 6, 2007, the contents of which are incorporated herein by referencein their entirety for all purposes.

FIELD OF THE INVENTION

This relates to the detection and interpretation of gestures by touchand hover sensitive devices, and more particularly, to the detection andinterpretation of real-world gestures and security-related gestures bytouch and hover sensitive devices.

BACKGROUND OF THE INVENTION

There exist today many styles of input devices for performing operationsin a computer system. The operations generally correspond to moving acursor and making selections on a display screen. The operations canalso include paging, scrolling, panning, zooming, etc. By way ofexample, the input devices can include buttons, switches, keyboards,mice, trackballs, touch pads, joy sticks, touch screens and the like.Each of these devices can have advantages and disadvantages that can betaken into account when designing a computer system.

Buttons and switches can be mechanical in nature and provide limitedcontrol with regards to the movement of the cursor and makingselections. For example, they can be dedicated to moving the cursor in aspecific direction (e.g., arrow keys) or to making specific selections(e.g., enter, delete, number, etc.).

In using a mouse instrument, the movement of the input pointer on adisplay can correspond to the relative movements of the mouse as theuser moves the mouse along a surface. In using a trackball instrument,the movement of the input pointer on the display can correspond to therelative movements of a trackball as the user moves the ball within ahousing. Mouse and trackball instruments can typically also include oneor more buttons for making selections. A mouse instrument can alsoinclude scroll wheels that can allow a user to scroll the displayedcontent by rolling the wheel forward or backward.

With touch pad instrument, such as touch pads on a personal laptopcomputer, the movement of the input pointer on a display can correspondto the relative movements of the user's finger (or stylus) as the fingeris moved along a surface of the touch pad. Touch screens, on the otherhand, are a type of display screen that typically can include atouch-sensitive transparent panel (or “skin”) that can overlay thedisplay screen. When using a touch screen, a user typically makes aselection on the display screen by pointing directly to objects (such asGUI objects) displayed on the screen (usually with a stylus or finger).

To provide additionally functionality, hand gestures have beenimplemented with some of these input devices. By way of example, intouch pads, selections can be made when one or more taps can be detectedon the surface of the touch pad. In some cases, any portion of the touchpad can be tapped, and in other cases a dedicated portion of the touchpad can be tapped. In addition to selections, scrolling can be initiatedby using finger motion at the edge of the touch pad.

U.S. Pat. Nos. 5,612,719 and 5,590,219, assigned to Apple Computer, Inc.describe some other uses of gesturing. U.S. Pat. No. 5,612,719 disclosesan onscreen button that can be responsive to at least two differentbutton gestures made on the screen on or near the button. U.S. Pat. No.5,590,219 discloses a method for recognizing an ellipse-type gestureinput on a display screen of a computer system.

In recent times, more advanced gestures have been implemented. Forexample, scrolling can be initiated by placing four fingers on the touchpad so that the scrolling gesture is recognized, and thereafter movingthese fingers on the touch pad to perform scrolling events. The methodsfor implementing these advanced gestures, however, can be limited and inmany instances counterintuitive. In certain applications, it can bebeneficial to enable a user to use “real-world” gestures such as handmovements and/or finger orientations that can be generally recognized tomean certain things, such as an “OK” signal, to more efficiently andaccurately effect intended operations.

SUMMARY OF THE INVENTION

Embodiments of the present invention are directed to the detection of“real-world” gestures that can be generally recognized to mean certainthings by a touch or hover sensitive device to more efficiently andaccurately effect intended operations in computing systems. Thesegestures can include, but are not limited to, “OK gestures,” “graspeverything gestures,” “stamp of approval gestures,” “circle selectgestures,” “X to delete gestures,” “knock to inquire gestures,”“hitchhiker directional gestures,” and “shape gestures.” In addition,gestures can be used to provide identification and allow or deny accessto applications, files, and the like.

An “OK gesture” can be detected and interpreted to perform operationsthat can require an affirmative response, such as a text box UI elementthat asks the user whether a certain action is to be taken. A “graspeverything gesture” can be detected and interpreted to performoperations that can require a grouping action, such as the grouping ofUI elements such as file icons or graphics symbols on a desktop ordrawing page. A “stamp of approval gesture” can be detected andinterpreted to perform operations that can require an affirmativeresponse, such as a UI element including text that asks the user whethera certain action is to be taken. A “circle select gesture” can bedetected and interpreted to perform operations that can require agrouping action, such as the grouping of UI elements such as file iconsor graphics symbols on a desktop or drawing page. An “X to deletegesture” can be detected and interpreted to perform operations that canrequire a deleting action, such as the deleting of a UI element such asfile icon or graphics symbol on a desktop or drawing page. A “knock toinquire gesture” can be detected and interpreted to perform operationsthat can require an inquiry action, such as determining whether anapplication, file, person of interest or other entity is available to beopened, accessed, communicated with, and the like. A “hitchhikergesture” can be detected and interpreted to perform operations that canrequire a directional input, such as a scrolling, panning, windowing,translating in time, and the like. Shape gestures can be detected andinterpreted to perform various operations, including creating a graphicimage of that shape, or selecting or moving UI elements. An“identification gesture” formed by either or both hands on or over atouch or hover sensitive device can be detected and interpreted toperform operations that can require an identification action, suchdetermining whether the user is to get access, or continue to haveaccess, to a restricted account, file or other item.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of an exemplary computer system according toone embodiment of this invention.

FIG. 2 illustrates another exemplary computer system according to oneembodiment of this invention.

FIG. 3 is an exemplary multipoint processing method according to oneembodiment of this invention.

FIGS. 4 a and 4 b illustrate a detected touch image according to oneembodiment of this invention.

FIG. 5 illustrates a group of features according to one embodiment ofthis invention.

FIG. 6 is a parameter calculation method according to one embodiment ofthis invention.

FIGS. 7 a-7 e and 7 i-7 k illustrate various gestures for performingtargeting and/or selecting tasks according to one embodiment of thisinvention.

FIGS. 7 f-7 h show a diagram of a method for recognizing andimplementing gestural inputs of FIGS. 7 a-7 e.

FIGS. 8 a-8 g illustrate a rotate gesture according to one embodiment ofthis invention.

FIG. 9 is a diagram of a touch-based method according to one embodimentof this invention.

FIG. 10 is a diagram of a touch-based method according to one embodimentof this invention.

FIG. 11 is a diagram of a touch-based method according to one embodimentof this invention.

FIG. 12 is a diagram of a zoom gesture method according to oneembodiment of this invention.

FIGS. 13 a-13 h illustrates a zooming sequence according to oneembodiment of this invention.

FIG. 14 is a diagram of a pan method according to one embodiment of thisinvention.

FIGS. 15 a-15 d illustrate a panning sequence according to oneembodiment of this invention.

FIG. 16 is a diagram of a rotate method according to one embodiment ofthis invention.

FIGS. 17 a-17 c illustrate a rotating sequence according to oneembodiment of this invention.

FIGS. 18 a-18 c illustrate an exemplary “OK gesture” according to oneembodiment of this invention.

FIGS. 19 a-19 d illustrate an exemplary “grasp everything gesture”according to one embodiment of this invention.

FIGS. 20 a-20 b illustrate an exemplary “stamp of approval gesture”according to one embodiment of this invention.

FIGS. 21 a-21 b illustrate an exemplary “circle select gesture”according to one embodiment of this invention.

FIGS. 22 a-22 d illustrate an exemplary “X to delete gesture” accordingto one embodiment of this invention.

FIGS. 23 a-23 b illustrate an exemplary “knock to inquire gesture”according to one embodiment of this invention.

FIGS. 24 a-24 h illustrate an exemplary “hitchhiker gesture” accordingto one embodiment of this invention.

FIGS. 25 a-25 h illustrate exemplary “shape gestures” according to oneembodiment of this invention.

FIGS. 26 a-26 h illustrate exemplary “identification gestures” and“lock/unlock gestures” according to one embodiment of this invention.

FIGS. 27 a-27 k illustrate exemplary “hand edge gestures” according toone embodiment of this invention.

FIGS. 28 a-28 b illustrate an exemplary “framing gesture” according toone embodiment of this invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

In the following description of preferred embodiments, reference is madeto the accompanying drawings which form a part hereof, and in which itis shown by way of illustration specific embodiments in which theinvention can be practiced. It is to be understood that otherembodiments can be utilized and structural changes can be made withoutdeparting from the scope of the preferred embodiments of the presentinvention.

“Real-world” gestures such as hand or finger movements/orientations thatare generally recognized to mean certain things (e.g., an “OK” handsignal generally indicates an affirmative response) can be interpretedby a touch or hover sensitive device to more efficiently and accuratelyeffect intended operations. These gestures can include, but are notlimited to, “OK gestures,” “grasp everything gestures,” “stamp ofapproval gestures,” “circle select gestures,” “X to delete gestures,”“knock to inquire gestures,” “hitchhiker directional gestures,” and“shape gestures.” In addition, gestures can be used to provideidentification and allow or deny access to applications, files, and thelike.

Although some embodiments of the invention can be described herein interms of capacitive multi-touch sensor panels, embodiments of theinvention can be generally applicable to other types of multi-touchsensors that can include resistive touch sensors, surface acoustic wavetouch sensors, electromagnetic touch sensors, near field imaging touchsensors, optical touch sensors and the like. In addition, although theproximity sensors can be described herein as infrared (IR) proximitysensors, embodiments of the invention can be generally applicable toother types of proximity sensors having an output that can be AC-coupledto an analog channel.

FIG. 1 is a block diagram of exemplary computer system 50 in accordancewith embodiments of this invention. Computer system 50 can correspond toa personal computer system, such as a desktop, laptop, tablet orhandheld computer. The computer system can also correspond to acomputing device, such as a mobile telephone, PDA, dedicated mediaplayer, consumer electronics device, and the like.

The exemplary computer system 50 shown in FIG. 1 can include a processor56 configured to execute instructions and to carry out operationsassociated with the computer system 50. For example, using instructionsretrieved, for example, from memory, the processor 56 can control thereception and manipulation of input and output data between componentsof the computing system 50. The processor 56 can be implemented on asingle-chip, multiple chips or multiple electrical components. Forexample, various architectures can be used for the processor 56,including dedicated or embedded processors, single purpose processors,controllers, ASICs, and so forth.

In most cases, the processor 56 together with an operating system canoperate to execute computer code and produce and use data. Operatingsystems are generally well known and will not be described in greaterdetail. By way of example, the operating system can correspond to OS/2,DOS, Unix, Linux, Palm OS, and the like. The operating system can alsobe a special purpose operating system, such as can be used for limitedpurpose appliance-type computing devices. The operating system, othercomputer code and data can reside within a memory block 58 that can beoperatively coupled to the processor 56. Memory block 58 can generallyprovide a place to store computer code and data that can be used by thecomputer system 50. By way of example, the memory block 58 can includeRead-Only Memory (ROM), Random-Access Memory (RAM), hard disk driveand/or the like. The information could also reside on a removablestorage medium and can be loaded or installed onto the computer system50 when needed. Removable storage mediums can include, for example,CD-ROM, PC-CARD, memory card, floppy disk, magnetic tape, and a networkcomponent.

The computer system 50 can also include a display device 68 that can beoperatively coupled to the processor 56. The display device 68 can be aliquid crystal display (LCD) (e.g., active matrix, passive matrix andthe like). Alternatively, the display device 68 can be a monitor such asa monochrome display, color graphics adapter (CGA) display, enhancedgraphics adapter (EGA) display, variable-graphics-array (VGA) display,super VGA display, cathode ray tube (CRT), and the like. The displaydevice can also correspond to a plasma display or a display implementedwith electronic inks.

The display device 68 can be configured to display a graphical userinterface (GUI) 69 that can provide an easy to use interface between auser of the computer system and the operating system or applicationrunning thereon. Generally speaking, the GUI 69 can represent programs,files and operational options with graphical images, objects, or vectorrepresentations. The graphical images can include windows, fields,dialog boxes, menus, icons, buttons, cursors, scroll bars, etc. Suchimages can be arranged in predefined layouts, or can be createddynamically to serve the specific actions being taken by a user. Duringoperation, the user can select and/or activate various graphical imagesin order to initiate functions and tasks associated therewith. By way ofexample, a user can select a button that opens, closes, minimizes, ormaximizes a window, or an icon that launches a particular program. TheGUI 69 can additionally or alternatively display information, such asnon interactive text and graphics, for the user on the display device68.

The computer system 50 can also include an input device 70 that can beoperatively coupled to the processor 56. The input device 70 can beconfigured to transfer data from the outside world into the computersystem 50. The input device 70 can, for example, be used to performtracking and to make selections with respect to the GUI 69 on thedisplay 68. The input device 70 can also be used to issue commands inthe computer system 50. The input device 70 can include a touch-sensingdevice configured to receive input from a user's touch and to send thisinformation to the processor 56. By way of example, the touch-sensingdevice can correspond to a touchpad or a touch screen. In many cases,the touch-sensing device can recognize touches, as well as the positionand magnitude of touches on a touch sensitive surface. The touch sensingdevice can detect and report the touches to the processor 56 and theprocessor 56 can interpret the touches in accordance with itsprogramming. For example, the processor 56 can initiate a task inaccordance with a particular touch. A dedicated processor can be used toprocess touches locally and reduce demand for the main processor of thecomputer system.

The touch sensing device can be based on sensing technologies includingbut not limited to capacitive sensing, resistive sensing, surfaceacoustic wave sensing, pressure sensing, optical sensing, and/or thelike. Furthermore, the touch sensing means can be based on single pointsensing or multipoint sensing. Single point sensing can be capable ofonly distinguishing a single touch, while multipoint sensing can becapable of distinguishing multiple touches that occur at the same time.

As discussed above, the input device 70 can be a touch screen that ispositioned over or in front of the display 68, integrated with thedisplay device 68, or can be a separate component, such as a touch pad.

The computer system 50 can also include capabilities for coupling to oneor more I/O devices 80. By way of example, the I/O devices 80 cancorrespond to keyboards, printers, scanners, cameras, microphones,speakers, and/or the like. The I/O devices 80 can be integrated with thecomputer system 50 or they can be separate components (e.g., peripheraldevices). In some cases, the I/O devices 80 can be connected to thecomputer system 50 through wired connections (e.g., cables/ports). Inother cases, the I/O devices 80 can be connected to the computer system80 through wireless connections. By way of example, the data link cancorrespond to PS/2, USB, IR, Firewire, RF, Bluetooth or the like.

In accordance with one embodiment of the present invention, the computersystem 50 can be designed to recognize gestures 85 applied to the inputdevice 70 and to control aspects of the computer system 50 based on thegestures 85. In some cases, a gesture can be defined as a stylizedinteraction with an input device that can be mapped to one or morespecific computing operations. The gestures 85 can be made throughvarious hand and finger motions. Alternatively or additionally, thegestures can be made with a stylus. In all of these cases, the inputdevice 70 can receive the gestures 85 and the processor 56 can executeinstructions to carry out operations associated with the gestures 85. Inaddition, the memory block 58 can include a gesture operational program88, which can be part of the operating system or a separate application.The gestural operation program 88 can generally include a set ofinstructions that can recognize the occurrence of gestures 85 and caninform one or more software agents of the gestures 85 and/or whataction(s) to take in response to the gestures 85. Additional detailsregarding the various gestures that can be used as input commands arediscussed further below.

In one embodiment, upon a user performing one or more gestures, theinput device 70 can relay gesture information to the processor 56. Usinginstructions from memory 58, and more particularly, the gesturaloperational program 88, the processor 56 can interpret the gestures 85and control different components of the computer system 50, such asmemory 58, a display 68 and I/O devices 80, based on the gestures 85.The gestures 85 can be identified as commands for performing actions inapplications stored in the memory 58, modifying image objects shown onthe display 68, modifying data stored in memory 58, and/or forperforming actions in I/O devices 80.

Again, although FIG. 1 illustrates the input device 70 and the display68 as two separate boxes for illustration purposes, the two boxes can berealized on one device.

FIG. 2 illustrates an exemplary computing system 10 that can use amulti-touch panel 24 as an input device for gestures; the multi-touchpanel 24 can at the same time be a display panel. The computing system10 can include one or more multi-touch panel processors 12 dedicated tothe multi-touch subsystem 27. Alternatively, the multi-touch panelprocessor functionality can be implemented by dedicated logic, such as astate machine. Peripherals 11 can include, but are not limited to,random access memory (RAM) or other types of memory or storage, watchdogtimers and the like. Multi-touch subsystem 27 can include, but is notlimited to, one or more analog channels 17, channel scan logic 18 anddriver logic 19. Channel scan logic 18 can access RAM 16, autonomouslyread data from the analog channels and provide control for the analogchannels. This control can include multiplexing columns of multi-touchpanel 24 to analog channels 17. In addition, channel scan logic 18 cancontrol the driver logic and stimulation signals being selectivelyapplied to rows of multi-touch panel 24. In some embodiments,multi-touch subsystem 27, multi-touch panel processor 12 and peripherals11 can be integrated into a single application specific integratedcircuit (ASIC).

Driver logic 19 can provide multiple multi-touch subsystem outputs 20and can present a proprietary interface that drives high voltage driver,which can include a decoder 21 and subsequent level shifter and driverstage 22, although level-shifting functions could be performed beforedecoder functions. Level shifter and driver 22 can provide levelshifting from a low voltage level (e.g. CMOS levels) to a higher voltagelevel, providing a better signal-to-noise (S/N) ratio for noisereduction purposes. Decoder 21 can decode the drive interface signals toone or more of N outputs, where N is the maximum number of rows in thepanel. Decoder 21 can be used to reduce the number of drive lines neededbetween the high voltage driver and multi-touch panel 24. Eachmulti-touch panel row input 23 can drive one or more rows in multi-touchpanel 24. It should be noted that driver 22 and decoder 21 can also beintegrated into a single ASIC, be integrated into driver logic 19, or insome instances be unnecessary.

The multi-touch panel 24 can include a capacitive sensing medium havinga plurality of row traces or driving lines and a plurality of columntraces or sensing lines, although other sensing media can also be used.The row and column traces can be formed from a substantially transparentconductive medium, such as Indium Tin Oxide (ITO) or Antimony Tin Oxide(ATO), although other transparent and non-transparent materials, such ascopper, can also be used. In some embodiments, the row and column tracescan be formed on opposite sides of a dielectric material, and can beperpendicular to each other, although in other embodiments othernon-Cartesian orientations are possible. For example, in a polarcoordinate system, the sensing lines can be concentric circles and thedriving lines can be radially extending lines (or vice versa). It shouldbe understood, therefore, that the terms “row” and “column,” “firstdimension” and “second dimension,” or “first axis” and “second axis” asused herein are intended to encompass not only orthogonal grids, but theintersecting traces of other geometric configurations having first andsecond dimensions (e.g. the concentric and radial lines of apolar-coordinate arrangement). The rows and columns can be formed on asingle side of a substrate, or can be formed on two separate substratesseparated by a dielectric material. In some instances, an additionaldielectric cover layer can be placed over the row or column traces tostrengthen the structure and protect the entire assembly from damage.

At the “intersections” of the traces of the multi-touch panel 24, wherethe traces pass above and below (cross) each other (but do not makedirect electrical contact with each other), the traces can essentiallyform two electrodes (although more than two traces could intersect aswell). Each intersection of row and column traces can represent acapacitive sensing node and can be viewed as picture element (pixel) 26,which can be particularly useful when multi-touch panel 24 is viewed ascapturing an “image” of touch. (In other words, after multi-touchsubsystem 27 has determined whether a touch event has been detected ateach touch sensor in the multi-touch panel, the pattern of touch sensorsin the multi-touch panel at which a touch event occurred can be viewedas an “image” of touch (e.g. a pattern of fingers touching the panel).)The capacitance between row and column electrodes can appear as a straycapacitance on all columns when the given row is held at DC and as amutual capacitance Csig when the given row is stimulated with an ACsignal. The presence of a finger or other object near or on themulti-touch panel can be detected by measuring changes to Csig. Thecolumns of multi-touch panel 124 can drive one or more analog channels17 (also referred to herein as event detection and demodulationcircuits) in multi-touch subsystem 27. In some implementations, eachcolumn is coupled to one dedicated analog channel 17. However, in otherimplementations, the columns can be couplable via an analog switch to afewer number of analog channels 17.

Computing system 10 can also include host processor 14 for receivingoutputs from multi-touch panel processor 12 and performing actions basedon the outputs that can include, but are not limited to, moving anobject such as a cursor or pointer, scrolling or panning, adjustingcontrol settings, opening a file or document, viewing a menu, making aselection, executing instructions, operating a peripheral deviceconnected to the host device, etc. Host processor 14, which can be apersonal computer CPU, can also perform additional functions that maynot be related to multi-touch panel processing, and can be coupled toprogram storage 15 and display device 13 such as an LCD display forproviding a user interface (UI) to a user of the device. It should alsobe noted that multi-touch panel 24 and display device 13 can beintegrated into one single touch-screen display device.

Although capacitive touch sensor panels can detect objects hoveringwithin the near-field of the panel, and appear to be “weak” touches, insome embodiments proximity sensor panels can be co-located with touchsensor panels to provide hover detection capabilities outside thenear-field hover detection capabilities of capacitive touch sensorpanels. These proximity sensor panels can be arranged as an array ofproximity sensors that can be scanned in a manner similar to thescanning of a touch sensor panel to generate an “image” of hover.Further details of multi-touch sensor detection, including proximitydetection by a touch panel and proximity detection by a proximity sensorpanel, are described in commonly assigned co-pending applications,including U.S. patent application Ser. No. 10/840,862, published on May11, 2006 as U.S. Publication No. US2006/0097991, and U.S. PatentApplication entitled “Proximity and Multi-Touch Sensor Detection andDemodulation,” filed on Jan. 3, 2007, the entirety of both which areincorporated herein by reference.

FIG. 3 illustrates a multipoint processing method 100, in accordancewith one embodiment of the present invention. The multipoint processingmethod 100 can, for example, be performed in the system shown in FIG. 1or 2. The multipoint processing method 100 generally begins at block 102where images can be read from a multipoint input device, and moreparticularly a multipoint touch screen. Although the term “image” isused it should be noted that the data can come in other forms. In mostcases, the image read from the touch screen provides magnitude (Z) as afunction of position (x and y) for each sensing point or pixel of thetouch screen. The magnitude can, for example, reflect the capacitancemeasured at each point.

Following block 102, multipoint processing method 100 can proceed toblock 104 where the image can be converted into a collection or list offeatures. Each feature can represent a distinct input such as a touch.In most cases, each feature can include its own unique identifier (ID),x coordinate, y coordinate, Z magnitude, angle Θ, area A, and the like.By way of example, FIGS. 4A and 4B illustrate a particular image 120 intime. In image 120, there are two features 122 based on two distincttouches. The touches can for example be formed from a pair of fingerstouching the touch screen. As shown, each feature 122 can include uniqueidentifier (ID), x coordinate, y coordinate, Z magnitude, angle .theta.,and area A. More particularly, the first feature 122A is represented byID₁, X₁, Y₁, Z₁, Θ₁, A₁ and the second feature 122B is represented byID₂, X₂, Y₂, Z₂, Θ₂, A₂. This data can be outputted for example using amulti-touch protocol.

The conversion from data or images to features can be accomplished usingmethods described in U.S. Patent Publication No. US2006/009771. Asdisclosed therein, the raw data can typically be received in a digitizedform, and can include values for each node of the touch screen. Thevalues can be between 0 and 256 where 0 equates to no touch pressure and256 equates to full touch pressure. Thereafter, the raw data can befiltered to reduce noise. Once filtered, gradient data, which indicatesthe topology of each group of connected points, can be generated.Thereafter, the boundaries for touch regions can be calculated based onthe gradient data (i.e., a determination is made as to which points aregrouped together to form each touch region). By way of example, awatershed algorithm can be used. Once the boundaries are determined, thedata for each of the touch regions can be calculated (e.g., X, Y, Z, θ,A).

Following block 104, multipoint processing method 100 proceeds to block106 where feature classification and groupings can be performed. Duringclassification, the identity of each of the features can be determined.For example, the features can be classified as a particular finger,thumb, palm or other object. Once classified, the features can begrouped. The manner in which the groups are formed can be widely varied.In most cases, the features can be grouped based on some criteria (e.g.,they carry a similar attribute). For example, the two features shown inFIGS. 4A and 4B can be grouped together because each of these featurescan be located in proximity to each other or because they can be fromthe same hand. The grouping can include some level of filtering tofilter out features that are not part of the touch event. In filtering,one or more features can be rejected because they either meet somepredefined criteria or because they do not meet some criteria. By way ofexample, one of the features can be classified as a thumb located at theedge of a tablet PC. Because the thumb can be used to hold the devicerather than being used to perform a task, the feature generatedtherefrom can be rejected, i.e., is not considered part of the touchevent being processed.

Following block 106, the multipoint processing method 100 proceeds toblock 108 where key parameters for the feature groups can be calculated.The key parameters can include distance between features, x/y centroidof all features, feature rotation, total pressure of the group (e.g.,pressure at centroid), and the like. As shown in FIG. 5, the calculationcan include finding the centroid C, drawing a virtual line 130 to eachfeature from the centroid C, defining the distance D for each virtualline (D₁ and D₂), and then averaging the distances D₁ and D₂. Once theparameters are calculated, the parameter values can be reported. Theparameter values can typically be reported with a group identifier (GID)and number of features within each group (in this case three). In mostcases, both initial and current parameter values can be reported. Theinitial parameter values can be based on set down or touch down, i.e.,when the user sets their fingers on the touch screen, and the currentvalues can be based on any point within a stroke occurring after setdown.

As should be appreciated, blocks 102-108 are repetitively performedduring a user stroke thereby generating a plurality of sequentiallyconfigured signals. The initial and current parameters can be comparedin later steps to perform actions in the system.

Following block 108, the process flow proceeds to block 110 where thegroup is or can be associated with a user interface (UI) element. UIelements can be buttons boxes, lists, sliders, wheels, knobs, pictures,documents, icons, etc. Each UI element can represent a component orcontrol of the user interface. The application behind the UI element(s)can have access to the parameter data calculated in block 108. In oneimplementation, the application can rank the relevance of the touch datato the UI element corresponding there to. The ranking can be based onsome predetermined criteria. The ranking can include producing a figureof merit, and whichever UI element has the highest figure of merit,giving it sole access to the group. There can even be some degree ofhysteresis as well (once one of the UI elements claims control of thatgroup, the group sticks with the UI element until another UI element hasa much higher ranking). By way of example, the ranking can includedetermining proximity of the centroid (or features) to the image objectassociated with the UI element.

Following block 110, the multipoint processing method 100 proceeds toblocks 112 and 114. The blocks 112 and 114 can be performedapproximately at the same time. From the user perspective, in oneembodiment, the blocks 112 and 114 appear to be performed concurrently.In block 112, one or more actions can be performed based on differencesbetween initial and current parameter values, and can also be based on aUI element to which they are associated, if any. In block 114, userfeedback pertaining to the one ore more action being performed can beprovided. By way of example, user feedback can include display, audio,tactile feedback and/or the like.

FIG. 6 is a parameter calculation method 150, in accordance with oneembodiment of the present invention. The parameter calculation method150 can, for example, correspond to block 108 shown in FIG. 3. Theparameter calculation method 150 generally begins at block 152 where agroup of features can be received. Following block 152, the parametercalculation method 150 proceeds to block 154 where a determination canbe made as to whether or not the number of features in the group offeatures has changed. For example, the number of features can havechanged due to the user picking up or placing an additional finger.Different fingers can be needed to perform different controls (e.g.,tracking, gesturing). If the number of features has changed, theparameter calculation method 150 can proceed to block 156 where theinitial parameter values can be calculated. If the number stays thesame, the parameter calculation method 150 can proceed to block 158where the current parameter values can be calculated. Thereafter, theparameter calculation method 150 proceeds to block 160 where the initialand current parameter values can be reported. By way of example, theinitial parameter values can contain the average initial distancebetween points (or Distance (AVG) initial) and the current parametervalues can contain the average current distance between points (orDistance (AVG) current). These can be compared in subsequent steps inorder to control various aspects of a computer system.

The above methods and techniques can be used to implement any number ofGUI interface objects and actions. For example, gestures can be createdto detect and effect a user command to resize a window, scroll adisplay, rotate an object, zoom in or out of a displayed view, delete orinsert text or other objects, etc.

A basic category of gestures can allow a user to input the commoncommands that can be inputted through the use of a conventional mouse ortrackball instrument. FIG. 7F shows a flow chart for processing thedetection of mouse-click actions. Starting with block 710, detection canbe made of either one or two touches by fingers. If the touch detectedis determined 711 to be one finger, then a determination 712 can be madeas to whether the touch is in a predetermined proximity of a displayedimage object that is associated with a selectable file object, and ifso, then a selection action can be made 714. If a double tap action isdetected 716 in association with a selectable object, then adouble-click action can be invoked 718. A double tap action can bedetermined by the detection of a finger leaving the touch screen andimmediately retouching the touch screen twice. In accordance with analternative embodiment, a double-click action can also be invoked if itis detected that a finger touch on a selected object remains for morethan a predetermined period of time.

As shown in FIG. 7G, if the one finger touch detected is not associatedwith a selectable file object, but rather is determined 720 to beassociated with a network address hyperlink, then a single-click actioncan be invoked whereby the hyperlink is activated. If the hyperlink wastouched within a non-browser environment, then a browser applicationwould also be launched.

If a two finger touch is detected 711, then if at least one of thetouchdown point is associated with a selectable file object 713, then aselection 715 can be made of the object. If one or more tap by one ofthe fingers on the touch sensitive display is detected 717 while thetouchdown point is maintained, then a right-click mouse action can beinvoked.

In accordance with a preferred embodiment, if a touch or touchesdetected are not associated with any selectable file object orhyperlinks, then as shown in FIG. 7H, a determination 722 can be made asto whether the touchdown point(s) is/are associated with a scrollablearea, such as text editing application window, a file listing window, oran Internet webpage.

Scrolling generally pertains to moving displayed data or images across aviewing area on a display screen so that a new set of data is broughtinto view in the viewing area. In most cases, once the viewing area isfull, each new set of data appears at the edge of the viewing area andall other sets of data move over one position. That is, the new set ofdata appears for each set of data that moves out of the viewing area. Inessence, these functions allow a user to view consecutive sets of datacurrently outside of the viewing area. In most cases, the user is ableto accelerate their traversal through the data sets by moving his or herfinger at greater speeds. Examples of scrolling through lists can befound in U.S. Patent Publication Nos. 2003/0076303A1, 2003/0076301A1,2003/0095096A1, which are herein incorporated by reference.

If the touch down point(s) is/are within a scrollable area, then ascrolling action can be invoked 723 similar to the pressing down of ascroll wheel on a conventional mouse instrument. If the scrollable areais scrollable in only one direction (e.g., up and down), then thescrolling action invoked will be unidirectional scroll. If thescrollable area is scrollable two dimensionally, then the scrollingaction invoked will be omnidirectional.

In a unidirectional scrolling action where the scrolling is restrictedto the vertical direction (i.e., the Y axis), only the vertical vectorcomponent of the tracked touch movement can be used as input foreffecting vertical scrolling. Similarly, in a unidirectional scrollingaction where the scrolling is restricted to the horizontal direction(i.e., the X axis), only the horizontal vector component of the trackedtouch movement can be used as input for effecting horizontal scrolling.If the scrolling action is omnidirectional, then the scrolling actioneffected can track the movement of the tracked touch.

If the detected touch is a one finger touch, then the scrolling actionis ready to be performed 724 at a normal, or 1×, speed. If and once thetouched down finger begins to move on the touch screen, then a scrollaction can be performed by tracking the movement of the touchdown pointon the touch screen. If the detected touch is a two finger touch, thenthe scrolling action can be performed 725 at a double, or 2× speed.Additional fingers can be added to perform even faster scrolling action,where a detection of a four finger touch can be translated into “pg up”or “pg dn” commands within a multi-page document window.

In accordance with another embodiment, the displayed data can continueto move even when the finger is removed from the touch screen. Thecontinuous motion can be based at least in part on the previous motion.For example the scrolling can be continued in the same direction andspeed. In some cases, the scrolling slow down over time, i.e., the speedof the traversal through the media items can get slower and slower untilthe scrolling eventually stops, thereby leaving a static list. By way ofexample, each new media item brought into the viewing area canincrementally decrease the speed. Alternatively or additionally, thedisplayed data can stop moving when the finger is placed back on thetouch screen. That is, the placement of the finger back on the touchscreen can implement braking, which can stop or slow down the continuousacting motion.

By way of examples to illustrate the above discussed gestural actions,as shown in FIG. 7A, using a touch screen (such as the multi-touchscreen 24 shown in FIG. 2), a single finger tap with a finger 501 on animage object (e.g., a file listing 500) can be translated into theequivalent of a single click of a mouse, which in this instance canindicate a selection, which is typically indicated by highlighting ofthe selected file or image object. A detected double tap on the imageobject can be translated into the equivalent of a double click of amouse, which can invoke a launch of an application associated with theimage object tapped. For instance, a double tapping of a file listing ona screen, such as a photo file, can cause the launch of a photo viewerapplication and the opening of that photo file.

Drag-and-drop function can be invoked by touching, with at least onefinger, the image associated with the object to be dropped andgraphically dragging the object to the desired drop location bymaintaining the touch, such as shown in FIG. 7B, illustrating a drag anddrop of a file listing 501 from folder window 502 to folder window 503.

Certain mouse functionalities can require two touches to complete. Forinstance, as shown in FIG. 7C, a “right click” gesture can be invoked bytwo fingers, with one finger as the touchdown finger 506 and a secondfinger 507 tapping the screen at least once to indicate a right clickaction. FIG. 7D illustrates that, after a right click action isperformed, an action window 504 can be invoked, after which the firstfinger can move over to the invoked window 504 to select and tap anaction item 505 with a single finger 506. In accordance with oneembodiment of the invention, a right click action can be effected onlyif the tapping detected is located in close proximity of the detectedtouchdown, and only if the tapping detected is located to the left ofthe touchdown finger (right of the touchdown finger from the user'spoint of view).

Other file selection functions that can normally require a combinationof mouse and keyboard action can be performed using only touch action.For instance, in the Microsoft Windows environment, in to selectmultiple files within file window 502, a user typically needs to holddown the shift button while dragging the mouse icon over the sequentialfiles to be selected. Without holding down the shift button, thedragging of the mouse icon can be interpreted as a drag and drop action.As shown in FIG. 7E, in accordance with an embodiment of the presentinvention, a detection of two closely associated touch drag of a filelistings can be read as a multi-selection action for selecting a groupof files 508. In order to avoid misinterpreting the two-touch action asanother command, such as a rotating action, the two-touchmulti-selection function can be invoked only if the two touches detectedare in relative close proximity to each other.

Referring to the scrolling actions described in FIG. 7H, and as shown inFIGS. 7I and 7J, a one or two finger touchdown within a scrollablewindow can cause the displayed content of the window to scroll atdifferent speeds. Specifically, once a scrolling action is invoked 723,the scrolling takes place at 1× speed 724 if it is determined that onlyone finger (or one touchdown point) is detected on the touch sensitivedisplay, and at 2× speed if two fingers (or two touchdown points) aredetected. In accordance with a preferred embodiment, during the scrollaction, scroll bars 727 and 728 can move in correspondence to thedirection of the scrolls.

Finally, using a multi-touch display that is capable of proximitydetection, such as the panels described in the aforementioned andincorporated by reference commonly assigned co-pending U.S. applicationSer. No. 10/840,862 (published as U.S. Patent Publication No.US2006/0097991) and an application entitled “Proximity and Multi-TouchSensor Detection and Demodulation,” Attorney Docket No. 10684-20011.00,filed on Jan. 3, 2007, gestures of a finger can also be used to invokehovering action that is can be the equivalent of hovering a mouse iconover an image object.

By way of an example, referring to FIG. 7K, the detection of proximityof a user's finger 501 over application icons 731 within a desktop 729can be interpreted as a hovering action, which can invoke the rollingpopup of the hovered application icon 730. If the user touches thepopped up icon, then a double-click action can be invoked whereby theapplication can be launched. Similar concepts can be applied applicationspecific situations, such as when photo files are displayed in athumbnail format within a photo management software, and a detection ofproximity of a finger over a thumbnail invokes a hover action wherebythe size of the hovered photo thumbnail can be enlarged (but notselected).

Gestures can also be used to invoke and manipulate virtual controlinterfaces, such as volume knobs, switches, sliders, keyboards, andother virtual interfaces that can be created to facilitate humaninteraction with a computing system or a consumer electronic item. Byway of an example, using a gesture to invoke a virtual controlinterface, and referring to FIGS. 8A-8H, a rotate gesture forcontrolling a virtual volume knob 170 on a GUI interface 172 of adisplay 174 of a tablet PC 175 will be described. In order to actuatethe knob 170, the user places their fingers 176 on a multipoint touchscreen 178. The virtual control knob can already be displayed, or theparticular number, orientation or profile of the fingers at set down, orthe movement of the fingers immediately thereafter, or some combinationof these and other characteristics of the user's interaction can invokethe virtual control knob to be displayed. In either case, the computingsystem can associate a finger group to the virtual control knob and canmake a determination that the user intends to use the virtual volumeknob.

This association can also be based in part on the mode or current stateof the computing device at the time of the input. For example, the samegesture can be interpreted alternatively as a volume knob gesture if asong is currently playing on the computing device, or as a rotatecommand if an object editing application is being executed. Other userfeedback can be provided, including for example audible or tactilefeedback.

Once the knob 170 is displayed as shown in FIG. 8A, the user's fingers176 can be positioned around the knob 170 similar to if it were anactual knob or dial, and thereafter can be rotated around the knob 170in order to simulate turning the knob 170. Again, audible feedback inthe form of a clicking sound or tactile feedback in the form ofvibration, for example, can be provided as the knob 170 is “rotated.”The user can also use their other hand to hold the tablet PC 175.

As shown in FIG. 8B, the multipoint touch screen 178 can detect at leasta pair of images. In particular, a first image 180 can be created at setdown, and at least one other image 182 can be created when the fingers176 are rotated. Although only two images are shown, in most cases therecanca be many more images that incrementally occur between these twoimages. Each image can represent a profile of the fingers in contactwith the touch screen at a particular instant in time. These images canalso be referred to as touch images. It will be understood that the term“image” does not mean that the profile is displayed on the screen 178(but rather imaged by the touch sensing device). It should also be notedthat although the term “image” is used, the data can be in other formsrepresentative of the touch plane at various times.

As shown in FIG. 8C, each of the images 180 and 182 can be converted toa collection of features 184. Each feature 184 can be associated with aparticular touch as for example from the tips each of the fingers 176surrounding the knob 170 as well as the thumb of the other hand 177 usedto hold the tablet PC 175.

As shown in FIG. 8D, the features 184 can be classified, i.e., eachfinger/thumb can be identified, and grouped for each of the images 180and 182. In this particular case, the features 184A associated with theknob 170 can be grouped together to form group 188 and the feature 184Bassociated with the thumb is filtered out. In alternative arrangements,the thumb feature 184B can be treated as a separate feature by itself(or in another group), for example, to alter the input or operationalmode of the system or to implement another gesture, for example, aslider gesture associated with an equalizer slider displayed on thescreen in the area of the thumb (or other finger).

As shown in FIG. 8E, the key parameters of the feature group 188 can becalculated for each image 180 and 182. The key parameters associatedwith the first image 180 represent the initial state and the keyparameters of the second image 182 represent the current state.

Also as shown in FIG. 8E, the knob 170 can be the UI element associatedwith the feature group 188 because of its proximity to the knob 170.Thereafter, as shown in FIG. 8F, the key parameter values of the featuregroup 188 from each image 180 and 182 can be compared to determine therotation vector, i.e., the group of features rotated five (5) degreesclockwise from the initial to current state. In FIG. 8F, the initialfeature group (image 180) is shown in dashed lines while the currentfeature group (image 182) is shown in solid lines.

As shown in FIG. 8G, based on the rotation vector, the speaker 192 ofthe tablet PC 175 increases (or decreases) its output in accordance withthe amount of rotation of the fingers 176, e.g., increase the volume by5% based on rotation of 5 degrees. The display 174 of the tablet PC canalso adjust the rotation of the knob 170 in accordance with the amountof rotation of the fingers 176, e.g., the position of the knob 170rotates five (5) degrees. In most cases, the rotation of the knob canoccur simultaneously with the rotation of the fingers, i.e., for everydegree of finger rotation the knob can rotate a degree. In essence, thevirtual control knob can follow the gesture occurring on the screen.Still further, an audio unit 194 of the tablet PC can provide a clickingsound for each unit of rotation, e.g., provide five clicks based onrotation of five degrees. Still yet further, a haptics unit 196 of thetablet PC 175 can provide a certain amount of vibration or other tactilefeedback for each click thereby simulating an actual knob.

It should be noted that additional gestures can be performedsimultaneously with the virtual control knob gesture. For example, morethan one virtual control knob can be controlled at the same time usingboth hands, i.e., one hand for each virtual control knob. Alternativelyor additionally, one or more slider bars can be controlled at the sametime as the virtual control knob, i.e., one hand operates the virtualcontrol knob, while at least one finger and maybe more than one fingerof the opposite hand can operate at least one slider and maybe more thanone slider bar, e.g., slider bar for each finger.

It should also be noted that although the described embodiment utilizesa virtual control knob, in another embodiment, the UI element can be avirtual scroll wheel. As an example, the virtual scroll wheel can mimican actual scroll wheel such as those described in U.S. PatentPublication Nos. US2003/0076303A1, US2003/0076301A1, andUS2003/0095096A1, all of which are herein incorporated by reference.

FIG. 9 is a diagram of a touch-based method 200 in accordance with oneembodiment of the present invention. The method generally begins atblock 202 where a user input that occurs over a multipoint sensingdevice is detected. The user input can include one or more touch inputs,with each touch input having a unique identifier. Following block 202,the touch-based method 200 proceeds to block 204 where the user inputcan be classified as a tracking or selection input when the user inputcan include a single unique identifier (one touch input), or can beclassified as a gesture input when the user input can include at leasttwo unique identifiers (more than one touch input). If the user input isclassified as a tracking input, the touch-based method 200 proceeds toblock 206 where tracking can be performed corresponding to the userinput.

If the user input is classified as a gesture input, the touch-basedmethod 200 proceeds to block 208 where one or more gesture controlactions can be performed corresponding to the user input. The gesturecontrol actions can be based at least in part on changes that occur withor between the at least two unique identifiers.

FIG. 10 is a diagram of a touch-based method 250 in accordance with oneembodiment of the present invention. The touch-based method 250generally begins at block 252 where an initial image can be capturedduring an input stroke on a touch sensitive surface. Following block252, the touch-based method 250 proceeds to block 254 where the touchmode can be determined based on the initial image. For example, if theinitial image includes a single unique identifier then the touch modecan correspond to a tracking or selection mode. On the other hand, ifthe image includes more than one unique identifier, then the touch modecan correspond to a gesture mode.

Following block 254, the touch-based method 250 proceeds to block 256where a next image can be captured during the input stroke on the touchsensitive surface. Images can be captured sequentially during the strokeand thus the there can be a plurality of images associated with thestroke.

Following block 256, touch-based method 250 proceeds to block 258 wherea determination can be made as to whether the touch mode changed betweencapture of the initial image and capture of the next image. If the touchmode changed, the touch-based method 250 proceeds to block 260 where thenext image can be set as the initial image and thereafter the touch modecan again be determined at block 254 based on the new initial image. Ifthe touch mode stayed the same, the touch-based method 250 proceeds toblock 262 where the initial and next images can be compared and one ormore control signals can be generated based on the comparison.

FIG. 11 is a diagram of a touch-based method 300 in accordance with oneembodiment of the present invention. The touch-based method 300 beginsat block 302 where an image object, which can be a GUI object, can beoutput. For example, a processor can instruct a display to display aparticular image object. Following block 302, the touch-based method 300proceeds to block 304 where a gesture input can be received over theimage object. For instance, a user can set or move their fingers in agestural way on the surface of the touch screen and while over thedisplayed image object. The gestural input can include one or moresingle gestures that occur consecutively or multiple gestures that occursimultaneously. Each of the gestures generally can have a particularsequence, motion, or orientation associated therewith. For example, agesture can include spreading fingers apart or closing fingers together,rotating the fingers, translating the fingers, and/or the like.

Following block 304, the touch-based method 300 proceeds to block 306where the Image object can be modified based on and in unison with thegesture input. By modified, it is meant that the image object can changeaccording to the particular gesture or gestures being performed. By inunison, it is meant that the changes can occur approximately while thegesture or gestures are being performed. In most cases, there is a oneto one relationship between the gesture(s) and the changes occurring atthe image object and they occur substantially simultaneously. Inessence, the image object follows the motion of the fingers. Forexample, spreading of the fingers can simultaneously enlarge the object,closing of the fingers can simultaneously reduce the image object,rotating the fingers can simultaneously rotate the object, andtranslating the fingers can allow simultaneous panning or scrolling ofthe image object.

In one embodiment, block 306 can include determining which image objectis associated with the gesture being performed, and thereafter lockingthe displayed object to the fingers disposed over it such that the imageobject changes in accordance with the gestural input. By locking orassociating the fingers to the image object, the image object cancontinuously adjust itself in accordance to what the fingers are doingon the touch screen. Often the determination and locking occurs at setdown, i.e., when the finger is positioned on the touch screen.

FIG. 12 is a diagram of a zoom gesture method 350 in accordance with oneembodiment of the present invention. The zoom gesture can be performedon a multipoint touch screen such as the multi-touch panel 24 shown inFIG. 2. The zoom gesture method 350 generally begins at block 352 wherethe presence of at least a first finger and a second finger are detectedon a touch sensitive surface at the same time. The presence of at leasttwo fingers can be configured to indicate that the touch is a gesturaltouch rather than a tracking touch based on one finger. In some cases,the presence of only two fingers indicates that the touch is a gesturaltouch. In other cases, any number of more than two fingers indicatesthat the touch is a gestural touch. In fact, the gestural touch can beconfigured to operate whether two, three, four or more fingers aretouching, and even if the numbers change during the gesture, i.e., onlyneed a minimum of two fingers at any time during the gesture.

Following block 352, the zoom gesture method 350 proceeds to block 354where the distance between at least the two fingers can be compared. Thedistance can be from finger to finger or from each finger to some otherreference point as for example the centroid. If the distance between thetwo fingers increases (spread apart), a zoom-in signal can be generatedas shown in block 356. If the distance between two fingers decreases(close together), a zoom-out signal can be generated as shown in block358. In most cases, the set down of the fingers will associate or lockthe fingers to a particular image object being displayed. For example,the touch sensitive surface can be a touch screen, and the image objectcan be displayed on the touch screen. This typically occurs when atleast one of the fingers is positioned over the image object. As aresult, when the fingers are moved apart, the zoom-in signal can be usedto increase the size of the embedded features in the image object andwhen the fingers are pinched together, the zoom-out signal can be usedto decrease the size of embedded features in the object. The zoomingtypically occurs within a predefined boundary such as the periphery ofthe display, the periphery of a window, the edge of the image object,and/or the like. The embedded features can be formed on a plurality oflayers, each of which represents a different level of zoom.

In most cases, the amount of zooming varies according to the distancebetween the two objects. Furthermore, the zooming typically can occursubstantially simultaneously with the motion of the objects. Forinstance, as the fingers spread apart or closes together, the objectzooms in or zooms out at the same time. Although this methodology isdirected at zooming, it should be noted that it can also be used forenlarging or reducing. The zoom gesture method 350 can be particularlyuseful in graphical programs such as publishing, photo, and drawingprograms. Moreover, zooming can be used to control a peripheral devicesuch as a camera, i.e., when the finger is spread apart, the camerazooms out, and when the fingers are closed the camera zooms in.

FIGS. 13A-13H illustrate a zooming sequence using the method describedabove. FIG. 13A illustrates a display presenting an image object 364 inthe form of a map of North America with embedded levels which can bezoomed. In some cases, as shown, the image object is positioned inside awindow that forms a boundary of the image object 364. FIG. 13Billustrates a user positioning their fingers 366 over a region of NorthAmerica 368, particularly the United States 370 and more particularlyCalifornia 372. In order to zoom in on California 372, the user startsto spread their fingers 366 apart as shown in FIG. 13C. As the fingers366 spread apart further (detected distance increases), the map zooms infurther on Northern California 374, then to a particular region ofNorthern California 374, then to the Bay area 376, then to the peninsula378 (e.g., the area between San Francisco and San Jose Area), and thento the city of San Carlos 380 located between San Francisco and San Joseas illustrated in FIGS. 13D-13H. In order to zoom out of San Carlos 380and back to North America 368, the fingers 366 can be closed backtogether following the sequence described above, but in reverse.

FIG. 14 is a diagram of a pan method 400, in accordance with oneembodiment of the present invention. The pan gesture can be performed ona multipoint touch screen. The pan method 400 generally begins at block402 where the presence of at least a first object and a second objectare detected on a touch sensitive surface at the same time. The presenceof at least two fingers can be configured to indicate that the touch isa gestural touch rather than a tracking touch based on one finger. Insome cases, the presence of only two fingers indicates that the touch isa gestural touch. In other cases, any number of more than two fingersindicates that the touch is a gestural touch. In fact, the gesturaltouch can be configured to operate whether two, three, four or morefingers are touching, and even if the numbers change during the gesture,i.e., need a minimum of only two fingers.

Following block 402, the pan method 400 proceeds to block 404 where theposition of the two objects when the objects are moved together acrossthe touch screen can be monitored. Following block 404, the pan method400 proceeds to block 406 where a pan signal can be generated when theposition of the two objects changes relative to an initial position. Inmost cases, the set down of the fingers can lock the fingers to aparticular image object displayed on the touch screen. As a result, whenthe fingers are moved together across the touch screen, the pan signalcan be used to translate the image in the direction of the fingers. Inmost cases, the amount of panning varies according to the distance thetwo objects move. Furthermore, the panning typically can occursubstantially simultaneously with the motion of the objects. Forinstance, as the fingers move, the object moves with the fingers at thesame time.

FIGS. 15A-15D illustrate a panning sequence based on the pan method 400described above. Using the map of FIG. 13A, FIG. 15A illustrates a userpositioning their fingers 366 over the map. Upon set down, the fingers366 are locked to the map. As shown in FIG. 15B, when the fingers 366are moved vertically up, the entire map 364 can be moved up, therebycausing previously seen portions of map 364 to be placed outside theviewing area and unseen portions of the map 364 to be placed inside theviewing area. As shown in FIG. 15C, when the fingers 366 are movedhorizontally sideways, the entire map 364 can be moved sideways therebycausing previously seen portions of map 364 to be placed outside theviewing area and unseen portions of the map to be placed inside theviewing area. As shown in FIG. 15D, when the fingers 366 are moveddiagonally, the entire map 364 can be moved diagonally thereby causingpreviously seen portions of map 364 to be placed outside the viewingarea and unseen portions of the map to be placed inside the viewingarea. As should be appreciated, the motion of the map 364 follows themotion of the fingers 366. This process is similar to sliding a piece ofpaper along a table. The pressure the fingers exert on the paper locksthe paper to the fingers and when the fingers are slid across the table,the piece of paper moves with them.

FIG. 16 is a diagram of a rotate method 450, in accordance with oneembodiment of the present invention. The rotate gesture can be performedon a multipoint touch screen. The rotate method 450 generally begins atblock 452 where the presence of a first object and a second object canbe detected at the same time. The presence of at least two fingers canbe configured to indicate that the touch is a gestural touch rather thana tracking touch based on one finger. In some cases, the presence ofonly two fingers indicates that the touch can be a gestural touch. Inother cases, any number of more than two fingers indicates that thetouch can be a gestural touch. In still some other instances, thegestural touch can be configured to operate whether two, three, four ormore fingers are touching, and even if the numbers change during thegesture, i.e., only need a minimum of two fingers.

Following block 452, the rotate method 450 proceeds to block 454 wherethe angle of each of the finger can be set. The angles can typically bedetermined relative to a reference point. Following block 454, rotatemethod 450 proceeds to block 456 where a rotate signal can be generatedwhen the angle of at least one of the objects changes relative to thereference point. In most cases, the set down of the fingers canassociate or lock the fingers to a particular image object displayed onthe touch screen. Typically, when at least one of the fingers ispositioned over the image on the image object, the image object can beassociated with or locked to the fingers. As a result, when the fingersare rotated, the rotate signal can be used to rotate the object in thedirection of finger rotation (e.g., clockwise, counterclockwise). Inmost cases, the amount of object rotation varies according to the amountof finger rotation, i.e., if the fingers move 5 degrees then so will theobject. Furthermore, the rotation typically can occur substantiallysimultaneously with the motion of the fingers. For instance, as thefingers rotate, the object rotates with the fingers at the same time.

FIGS. 17A-17C illustrate a rotating sequence based on the methoddescribed above. Using the map of FIG. 13, FIG. 17A illustrates a userpositioning their fingers 366 over the map 364. Upon set down, thefingers 366 are locked to the map 364. As shown in FIG. 17B, when thefingers 366 are rotated in a clockwise direction, the entire map 364 isrotated in the clockwise direction in accordance with the rotatingfingers 366. As shown in FIG. 17C, when the fingers 366 are rotated in acounterclockwise direction, the entire map 364 is rotated in the counterclockwise direction in accordance with the rotating fingers 366.

It should be noted that the methods described in FIGS. 12-17 can beimplemented during the same gestural stroke. That is, zooming, rotatingand panning can all be performed during the gestural stroke, which caninclude spreading, rotating and sliding fingers. For example, upon setdown with at least two fingers, the displayed object (map) can beassociated or locked to the two fingers. In order to zoom, the user canspread or close their fingers. In order to rotate, the user can rotatetheir fingers. In order to pan, the user can slid their fingers. Each ofthese actions can occur simultaneously in a continuous motion. Forexample, the user can spread and close their fingers while rotating andsliding them across the touch screen. Alternatively, the user cansegment each of these motions without having to reset the gesturalstroke. For example, the user can first spread their fingers, thenrotate their fingers, then close their fingers, then slide their fingersand so on.

Further details on the detection, interpretation and use of gestures aredescribed in Applicant's co-pending U.S. patent application Ser. No.11/038,590 entitled “Mode-Based Graphical User Interfaces for TouchSensitive Input Devices,” filed Jan. 18, 2005, and published as U.S.Patent Publication No. 2006/0026535 on Feb. 2, 2006, the contents ofwhich are incorporated by reference herein. Further details on detectingtouches on a surface and interpreting them as gestures are described inApplicant's co-pending U.S. patent application Ser. No. 11/428,522entitled “Identifying Contacts on a Touch Surface,” filed Jul. 3, 2006,and published as U.S. Patent Publication No. 2006/0238522 on Oct. 26,2006, the contents of which are incorporated by reference herein.Further details on “knuckle” and “pengrip” detection are described inU.S. Pat. No. 6,323,846 entitled “Method and Apparatus for IntegratingManual Input,” the contents of which are incorporated by referenceherein.

“Real-world” gestures such as hand or finger movements/orientations thatare generally recognized to mean certain things (e.g., an “OK” handsignal generally indicates an affirmative response) can be detected bythe touch or hover sensitive device and interpreted by the multipointprocessing method to more efficiently and accurately effect intendedoperations. Like other gestures described above, “real-world” gesturescan be detected and interpreted by first obtaining one or more images oftouch, classifying and grouping the features, calculating any keyparameters of the groups, associating the groups with UI elements whereapplicable, and then performing the appropriate action.

FIGS. 18 a-18 c illustrate an exemplary “OK gesture” according toembodiments of this invention. An “OK gesture” formed by either handover a touch or hover sensitive device can be detected and interpretedto perform operations requiring an affirmative response, such as a textbox UI element that asks the user whether a certain action is to betaken (e.g. “Are you sure you want to delete this file?”).

To detect an “OK gesture,” either a touch sensor panel with some hoverdetecting capability or a touch sensor panel co-located with a proximitysensor panel can be employed. FIG. 18 a illustrates an exemplaryorientation of a right hand 1800 giving an “OK gesture” over a UIelement 1810 appearing beneath sensor panel 1802 according toembodiments of this invention. In some embodiments, the thumb and indexfinger in the “OK” gesture can touch the sensor panel. Note thatalthough single-hand gestures will generally be described andillustrated herein as right-handed operations, it should be understoodthat embodiments of this invention are equally applicable to left-handedoperations.

In FIG. 18 b, the image of actual touch 1804 may not look like an “OKgesture” at all, because only the user's thumb, index finger, “pinky”finger and palm edge may actually make contact with the sensor panel.However, if a sensor panel with sufficient near-field and/or far-fieldsensitivity is utilized, the image can appear more like that shown inFIG. 18 c, with some or all of index finger and thumb 1806 beingdetected, although portion 1804 can have a much higher “magnitude” orz-component as compared to portion 1806, because portion 1804 is closerto the sensor panel. In some embodiments, image 1804 can be detected bya touch sensor panel, while image 1806 can be detected by a co-locatedproximity sensor panel. Images 1804 and 1806 can together be convertedinto a single feature as described above, or the images can be convertedinto two separate but overlapping features if image magnitude values areused to separate the two features.

If a single feature is found, and the various characteristics indicativeof an “OK gesture” are present (e.g., either images 1804 of FIG. 18 b,or a “finger circle” portion including an image of weak touch or hover1806 that completely or almost completely encircles a region of nodetected touch or hover 1808, in combination with one or more adjacentlonger and tapering “palm edge and pinky” images 1804), then thatfeature can be classified as an “OK gesture.” If two features are found,one can be classified as a “palm edge and pinky” feature and the othercan be classified as a “thumb/index finger” feature or a “finger circle”feature, and both features can be grouped together. The grouping canthen be analyzed to determine if the grouping indicates that a handforming an “OK gesture” is present. Note that the presence of otherimages in the same general area, or the absence of certain imageportions (e.g. the image portion caused by the pinky finger) is missing,can cause a lack of recognition of the “OK gesture.”

After the features have been classified and grouped, parameters for thegroup can be computed, such as an approximate center 1812 of the regionof no detected touch or hover, and if this center is located coincidentwith the UI element (e.g. text box 1810 in FIG. 18 a), the UI elementcan be associated with the group. An appropriate action can then betaken. In the present example, the detection of an “OK gesture” imagecoincident with the “Are you sure you want to delete this file?” textbox can cause the file to be deleted.

FIGS. 19 a-19 e illustrate an exemplary “grasp everything gesture”according to embodiments of this invention. A “grasp everything gesture”formed by either hand over a touch or hover sensitive device can bedetected and interpreted to perform operations requiring a groupingaction, such as the grouping of UI elements such as file icons orgraphics symbols on a desktop or drawing page.

To detect a “grasp everything gesture,” either a touch sensor panel withsome hover detecting capability or a touch sensor panel co-located witha proximity sensor panel can be employed. FIGS. 19 a and 19 b illustratea right hand 1900 performing a “grasp everything gesture” over aplurality of UI elements 1902 appearing beneath sensor panel 1904according to embodiments of this invention. In FIG. 19 a, right hand1900 can have a palm resting on the sensor panel with fingers extendedand touching the sensor panel at the approximate circumferential oroblong boundary 1906 of an area within which UI elements 1902 can begrouped. FIG. 19 b shows the image of touch corresponding to FIG. 19 a,including palm heel image 1916 and fingerprint images 1908. In FIG. 19b, images 1916 and 1908 can be classified as palm and fingerprintfeatures, respectively, and can be grouped together if their spacingsand orientations indicate that they are from the same hand, for example.In some embodiments, the multipoint processing method can receive one ormore inputs that cause the multipoint processing method to enter acertain mode and begin to convert the received images 1908 intofeatures, and classify, group, and interpret images as the start of apossible “grasp everything gesture.” In other embodiments, the detectionof images 1916 and 1908 spaced apart at distances indicating thepresence of a single spread hand can trigger the multipoint processingmethod to begin tracking these images as the start of a possible “graspeverything gesture.” In other embodiments, the context of images 1908,such as if they should appear over a desktop, can cause the images to beclassified, grouped, and interpreted as the start of a possible “graspeverything gesture.” In either case, parameters for the group can thenbe computed, such as circumferential boundary 1906.

FIG. 19 c shows right hand 1900 in a clenched first shape 1910 after the“grasp everything gesture” has been completed. In corresponding FIG. 19d, after completion of the gesture, the detected image can now include apalm image 1916 and images 1918, which can represent the light touchesof fingerprints or first knuckles resting on the sensor panel. However,between the images of FIGS. 19 b and 19 d, as the user's fingers werebeing drawn together into a clenched fist, temporary images 1920 mayhave been detected. These temporary images 1920 can be stored and usedto more accurately determine that a “grasp everything gesture” has beenperformed. In other words, to enhance the robustness of the detection ofthe “grasp everything gesture,” temporary images 1920 can be trackedover time and detected as being pulled inward for some distance, with agradually lighter “touch,” before disappearing, to further insure thedetection of fingers being pulled inward and upward toward the palm.

If the images and corresponding features are tracked over time, thecompletion of a “grasp everything gesture” can be accurately detected.At that point, all UI elements 1902 within the calculatedcircumferential boundary 1906 can be associated with the “graspeverything gesture” and selected together, for example. Any UI elementsoutside this boundary would not be “grasped.” Note, however, that ifafter some time has passed, instead of the images of FIG. 19 d, no imageof touch is detected, this can indicate that the user has merely liftedthe hand off of the sensor panel without performing a “grasp everythinggesture,” and in that case the “grasp everything gesture” would not bedetected.

After the “grasp everything gesture” has been detected and thecorresponding UI elements have been grouped, the user can utilizevarious input mechanisms to take further actions with regard to thegrasped UI elements. In an exemplary extension of the “grasp everythingfeature,” if images 1916 and 1918 are further tracked over time andrapidly get weaker and quickly disappear, indicating that the user mayhave quickly pulled the clenched hand away from the touch screen, thiscan be further interpreted as a “throw away” gesture to discard thegrasped UI elements (i.e. a “grasp and delete everything gesture”).

FIGS. 20 a-20 b illustrate an exemplary “stamp of approval gesture”according to embodiments of this invention. A “stamp of approvalgesture” formed by either hand over a touch or hover sensitive devicecan be detected and interpreted to perform operations requiring anaffirmative response, such as a UI element including text that asks theuser whether a certain action is to be taken (e.g. “Do you accept allhighlighted changes?”).

To detect a “stamp of approval gesture,” either a touch sensor panelwith some hover detecting capability or a touch sensor panel co-locatedwith a proximity sensor panel can be employed. FIG. 20 a illustrates anexemplary orientation of a right hand 2000 giving a “stamp of approvalgesture” over a UI element 2002 appearing beneath sensor panel 2004according to embodiments of this invention. In FIG. 20 b, the image ofactual touch 2006 may not look like a “stamp of approval gesture” atall, because only the user's curled “pinky” finger and palm edge canactually make contact with the sensor panel. To enhance the unambiguousdetection of a “stamp of approval gesture,” image 2006 can be requiredto appear and disappear rapidly, which would indicate a characteristicstamping motion by the user.

After the detected image 2006 has been converted to one or more featuresand classified and grouped as a “stamp of approval gesture,” parametersfor the feature can be computed, such as an approximate center 2008 ofthe “stamp of approval gesture,” and if this center is locatedcoincident with the UI element (e.g. text box 2002 in FIG. 20 a), the UIelement is associated with the gestures and appropriate action can thenbe taken. In the present example, the detection of an “stamp of approvalgesture” coincident with the “Do you accept all highlighted changes?”text box can cause the changes to become permanent.

FIGS. 21 a-21 b illustrate an exemplary “circle select gesture”according to embodiments of this invention. A “circle select gesture”formed by either hand over a touch or hover sensitive device can bedetected and interpreted to perform operations requiring a groupingaction, such as the grouping of UI elements such as file icons orgraphics symbols on a desktop or drawing page.

To detect a “circle select everything gesture,” either a touch sensorpanel with some hover detecting capability or a touch sensor panelco-located with a proximity sensor panel can be employed. FIG. 21 aillustrates a right hand 2100 giving a “circle select gesture” over aplurality of UI elements 2102 appearing beneath sensor panel 2104according to embodiments of this invention. In FIG. 21 a, right hand2100 can have only index finger extended and touching the sensor panel,although it should be understood that other fingers can be used as well.Over time, the index finger can trace the approximate boundary 2106 ofan area within which UI elements 2102 can be grouped or selected. Theboundary can be generally circular, oval, oblong or any shape thateventually closes on itself. FIG. 21 b shows the image of touchcorresponding to FIG. 21 a, including fingerprint images 2108 changingover time. In some embodiments, image processing logic can receive oneor more inputs that cause the image processing logic to enter a certainmode and begin to convert the received images 2108 into features, andclassify, group, and interpret the features as the start of a possible“circle select gesture.” In other embodiments, the context of images2108, such as if the images should appear over a desktop or graphicsapplication, can cause the images to be converted into features,classified, grouped, and interpreted as the start of a possible “circleselect gesture.” In either case, parameters for the group can then becomputed, such as circumferential boundary 2106.

If a “circle select gesture” is detected, all UI elements 2102 withinthe circumferential boundary 2106 can be associated with the gesture andselected together, for example. Any UI elements outside this boundarywould not be selected. However, if after some time has passed, themoving image of touch 2108 has not yet substantially returned to itsstarting point, this can indicate that the user has abandoned the“circle select gesture” or did not intend that gesture at all, and inthat case no UI elements would be selected.

After the “circle select gesture” has been detected and thecorresponding UI elements have been selected, the user can utilizevarious input mechanisms to take further actions with regard to theselected UI elements. For example, the selected UI elements can becopied, moved, or deleted as a group.

FIGS. 22 a-22 d illustrate an exemplary “X to delete gesture” accordingto embodiments of this invention. An “X to delete gesture” formed byeither hand over a touch or hover sensitive device can be detected andinterpreted to perform operations requiring a deleting action, such asthe deleting of a UI element such as file icon or graphics symbol on adesktop or drawing page.

To detect a “X to delete gesture,” either a touch sensor panel with somehover detecting capability or a touch sensor panel co-located with aproximity sensor panel can be employed. FIG. 22 a illustrates a righthand 2200 beginning a “X to delete gesture” over a UI element 2202appearing beneath sensor panel 2204 according to embodiments of thisinvention. In FIG. 22 a, right hand 2200 can have only index fingerextended and touching the sensor panel, although it should be understoodthat any finger can be used. Over time, the index finger can trace thefirst stroke 2006 of an “X” over UI element 2202. FIG. 22 b shows aseries of images of touch corresponding to FIG. 22 a, includingfingerprint images 2208 changing over time. In some embodiments, imageprocessing logic can receive one or more inputs that cause the imageprocessing logic to enter a certain mode and begin to convert thereceived images 2208 into features, and classify, group, and interpretthe features as the start of a possible “X to delete gesture.” In otherembodiments, images 2208 can be stored until further information isreceived that indicates this is the start of a possible “X to deletegesture.”

After stroke 2206 is completed, the user can then lift the finger offthe sensor panel and begin a second stroke 2208 shown in FIG. 22 c. Overtime, the index finger can trace the second stroke 2210 of the “X” overUI element 2202. FIG. 22 d shows a series of images of touchcorresponding to FIG. 22 c, including fingerprint images 2212 changingover time. In some embodiments, image processing logic can receive oneor more inputs that cause the processing logic to classify, group, andinterpret images 2212 as the second stroke of a possible “X to deletegesture.” In other embodiments, images 2212 can be stored until stroke2210 has been completed and it is confirmed that an “X to deletegesture” has been properly received. To ensure an accurate detection ofthe “X to delete gesture,” the multipoint processing method can requirethat the strokes 2206 and 2210 be completed within a short time of eachother.

If an “X to delete gesture” is detected, parameters can be computed suchas the intersection between the first and second strokes 2206 and 2210.The UI element 2202 beneath the between the first and second strokes2206 and 2210 can then be selected for deletion. Those UI elements thatdid not appear under the intersection would not be deleted. Note thatif, for example, after some time has passed, the second stroke 2210 isnot detected, this can indicate that the user has abandoned the “X todelete gesture” or that it was never intended to be an “X to deletegesture,” and in that case no UI elements would be selected and deleted.

FIGS. 23 a-23 b illustrate an exemplary “knock to inquire gesture”according to embodiments of this invention. A “knock to inquire gesture”formed by either hand over a touch or hover sensitive device can bedetected and interpreted to perform operations requiring an inquiryaction, such as determining whether an application, file, person ofinterest or other entity is available to be opened, accessed,communicated with, and the like.

To detect a “knock to inquire gesture,” either a touch sensor panel withsome hover detecting capability or a touch sensor panel co-located witha proximity sensor panel can be employed. FIG. 23 a illustrates a righthand 2300 executing a “knock to inquire gesture” over a UI element 2302appearing beneath sensor panel 2304 according to embodiments of thisinvention. Although not visible in FIG. 23 a, right hand 2300 can haveonly one to four knuckles actually contacting the sensor panel at anytime. FIG. 23 b shows exemplary images of touch 2306 corresponding toknuckles contacting the sensor panel in FIG. 23 a. These images can beconverted to features, and grouped and classified as knuckle features.Although not shown in FIG. 23 b, images 2306 can be generally repeatedtwo or more times, representing a typical knocking motion. In someembodiments, image processing logic can receive one or more inputs thatcause the processing logic to classify, group, and interpret images 2306as the start of a possible “knock to inquire gesture.” In otherembodiments, images 2306 can be stored until further information isreceived that indicates this is the start of a possible “knock toinquire gesture.” For example, after two or more groupings of images2306 are received in rapid succession, the image processing logic caninterpret these repetitively received, similarly located groups ofimages 2306 as a “knock to inquire gesture.”

If a “knock to inquire gesture” is detected, parameters can be computedsuch as the approximate center location 2308 of the knuckle on themiddle finger. The UI element 2302 beneath center location 2308 can thenbe associated with the gesture and selected for inquiry. Those UIelements that did not appear under center location 2308 would not beselected. Note that if, for example, after a first group of images 2306are detected, similar groups of images are not received in rapidsuccession, this can indicate that a “knock to inquire gesture” is notbeing performed.

After the “knock to inquire gesture” has been detected and thecorresponding UI element has been selected, further actions can be takenwith regard to the selected UI element. For example, if the selected UIalgorithm is a shared resource such as a limited access application, aninquiry as to whether the application is available to be checked out canbe initiated.

FIGS. 24 a-24 h illustrate an exemplary “hitchhiker gesture” accordingto embodiments of this invention. A “hitchhiker gesture” formed byeither hand over a touch or hover sensitive device can be detected andinterpreted to perform operations requiring a directional input, such asa scrolling, panning, windowing, translating in time (e.g., movingbackward or forward in time in an audio or video recording), and thelike.

To detect a “hitchhiker gesture,” either a touch sensor panel with somehover detecting capability or a touch sensor panel co-located with aproximity sensor panel can be employed. FIG. 24 a illustrates a righthand 2400 executing a “hitchhiker gesture” upon sensor panel 2404according to embodiments of this invention. Although not visible in FIG.24 a, right hand 2400 can have only certain hand parts actuallycontacting the sensor panel at any time. FIG. 24 b shows exemplaryimages of touch 2406 corresponding to the “hitchhiker gesture” in FIG.24 a. These images 2406 can be converted to images, and grouped andclassified as a left-pointing “hitchhiker gesture.” In particular, oneof the images 2406 can be identified as corresponding to a thumb, andfrom that identification a directional parameter (e.g. left) can becalculated. However, if images 2406 are insufficient to detect theleft-pointing “hitchhiker gesture” with reasonable accuracy, in someembodiments the near-field proximity detection capabilities of the touchsensors or a co-located proximity sensor panel can be used to detect anoutline 2408 of the left-pointing “hitchhiker gesture” to moreaccurately identify it. Detection of the left-pointing “hitchhikergesture” can the be used to scroll left, pan left, window left,translate left (earlier) in time (e.g., moving backward or forward intime in an audio or video recording), and the like.

FIGS. 24 c and 24 d, FIGS. 24 e and 24 f, and FIGS. 24 g and 24 hillustrate other orientations, detected touch and/or hover images forthe “hitchhiker gesture” pointing in different directions. FIGS. 24 dand 24 h, in particular, show how only the user's middle knuckle canform a touch image on sensor panel 2404, and therefore illustrate howoutline 2408 detected by the near-field proximity detection capabilitiesof the touch sensors or proximity sensors co-located with the touchsensors can be used advantageously to more accurately detect the“hitchhiker gesture.”

FIGS. 25 a-25 h illustrate exemplary “shape gestures” according toembodiments of this invention. Shape gestures formed by either or bothhands over a touch or hover sensitive device can be detected andinterpreted to perform various operations, including creating a graphicimage of that shape.

To detect a shape gesture, either a touch sensor panel with some hoverdetecting capability or a touch sensor panel co-located with a proximitysensor panel can be employed. FIG. 25 a illustrates both hands 2500making a “triangle shape gesture” over sensor panel 2502 according toembodiments of this invention. FIG. 25 b shows the images of actualtouch or hover 2504 that can be detected by the sensor panel. FIG. 25 cillustrates both hands 2500 making a “V shape gesture” over sensor panel2502 according to embodiments of this invention. FIG. 25 d shows theimages of actual touch 2506 or hover 2512 that can be detected by thesensor panel. FIG. 25 e illustrates both hands 2500 making an “invertedV shape gesture” over sensor panel 2502 according to embodiments of thisinvention. FIG. 25 f shows the images of actual touch 2508 or hover 2516that can be detected by the sensor panel. FIG. 25 g illustrates bothhands 2500 making an “wing shape gesture” over sensor panel 2502according to embodiments of this invention. FIG. 25 h shows the imagesof actual touch 2510 or hover 2514 that can be detected by the sensorpanel.

In any of these examples, the images can be converted to features, whichcan then be classified, grouped and interpreted as a particular gesture.Each gesture described above can then cause a particular operation to beperformed with regard to UI elements appearing with a certainrelationship to the gesture (e.g., UI elements within the triangleformed by the “triangle shape gesture” can be selected, zoomed, and thelike). In another example, if multiple touch images taken over timeindicate that the “V shape gesture” is being shifted upwards, UIelements within the “V” of the “V shape gesture” could be grouped andtranslated (moved) upward in accordance with the motion of the “V shapegesture.” A similar concept could apply to the “inverted V shapegesture.” The “wing shape gesture” could, for example, trigger thedesktop faxing or sending of a UI element under the gesture (e.g. adocument) or trigger the e-mailing of a document over the Internet.

FIGS. 26 a-26 d illustrate exemplary “identification gestures” accordingto embodiments of this invention. An “identification gesture” formed byeither or both hands on or over a touch or hover sensitive device can bedetected and interpreted to perform operations requiring anidentification action, such determining whether the user is to getaccess, or continue to have access, to a restricted account, file orother item.

To detect an “identification gesture,” either a touch sensor panel withsome hover detecting capability or a touch sensor panel co-located witha proximity sensor panel can be employed. FIG. 26 a illustrates a righthand 2600 beginning an exemplary “identification gesture” over a UI textbox element 2608 indicating “Please enter graphical/gestural password”appearing on sensor panel 2602 according to embodiments of thisinvention. In FIG. 26 a, right hand 2600 can have only index fingerextended and touching the sensor panel, although it should be understoodthat any one finger or multiple fingers, or even both hands can be used.Over time, the index finger can trace a graphical password 2604. FIG. 26b shows a series of captured images of touch corresponding to FIG. 26 a,including fingerprint images 2606 changing over time. In someembodiments, image processing logic can receive one or more inputs thatcause the image processing logic to enter a certain mode and begin toconvert the received images into features, and classify, group, andinterpret images 2606 as the start of a possible “identificationgesture” as soon as images start to appear. In other embodiments, images2606 can be stored until further information is received that identifiesthe images as an “identification gesture.” For example, if the complete“identification gesture” is the shape shown in FIG. 26 b, the compositeshape can be recognized as an “identification gesture” only after allimages 2606 are fully captured. Note that although the example of FIGS.26 a and 26 b only shows a single stroke by a single hand, it should beunderstood that one or more strokes, by either or both hands, in seriesor in parallel, can be used to create an “identification gesture” uniqueto the user. The gesture need not look like a signature, but can be anysymbolic representation of choice.

A time component can optionally be attached to an “identificationgesture.” For example, the entire “identification gesture” can berequired to be completed within a certain period of time (e.g. a fewseconds), otherwise the gesture will be rejected.

In another embodiment illustrated in FIG. 26 c, one or more hands 2600can be used to form an “identification gesture” that need not touch thesensor panel, or may only partially touch the sensor panel. In theexample of FIG. 26 c, the user's right hand can contact the sensorpanel, resulting in one or more vertical images of touch 2610 as shownin FIG. 26 d. In addition, the user's left hand can only hover over thesensor panel 2602, resulting in an image of hover 2612 with acharacteristic hole 2614 in it. In some embodiments, image processinglogic can receive one or more inputs that cause the image processinglogic to enter a certain mode and begin to convert the received imagesinto features, and classify, group, and interpret images 2610 and 2612as the start of a possible “identification gesture” as soon as imagesstart to appear. In other embodiments, images 2610 and 2612 can bestored until further information is received that identifies the imagesas an “identification gesture.” For example, if the complete“identification gesture” is the images shown in FIG. 26 d, the compositeshape can be recognized as an “identification gesture” only after allimages 2610 and 2612 are fully captured. It should be understood thatFIGS. 26 a-26 d are only exemplary in nature, and that a virtuallyunlimited number of gestures could be used as identification gestures.

In another embodiment illustrated in FIG. 26 e, the thumb and anotherfinger of hands 2600 can be placed together near a center of the touchsensor panel, resulting in the image of touch 2616 as shown in FIG. 26f, and then spread towards the four corners of the touch sensor panel,generating intermediate images 2618. Recognition of this “unlockgesture” can unlock and object on application. Alternatively, as shownin FIG. 26 g, the thumb and another finger of hands 2600 can be placedtowards the four corners of the touch sensor panel, resulting in theimage of touch 2620 as shown in FIG. 26 h, and then moved towards thecenter of the touch sensor panel, generating intermediate images 2622.Recognition of this “lock gesture” can lock and object on application.

FIGS. 27 a-27 k illustrate an exemplary “hand edge gesture” according toembodiments of this invention. A “hand edge gesture” can be formed byinitially placing the edge of either hand (palm edge plus pinky fingeredge) over a touch or hover sensitive device, with the thumb pointingupwards away from the sensor panel or alongside the index finger. Withthe thumb pointing toward the ceiling (or along surface normal vector),the wrist can advantageously have a much wider range of motion forrotation/orientation than in any other hand arrangement (includingrotation with five fingertips, or a hand flat on surface). The hand edgegesture, optionally along with various sweeping motions, can be detectedand interpreted to perform operations such as quickly orienting anapplication or control along the elongated axis defined by the detectedhand edge, changing to another desktop, sweeping away applications,icons, files, performing undo or redo operations, and the like.

To detect a “hand edge gesture,” either a touch sensor panel with somehover detecting capability or a touch sensor panel co-located with aproximity sensor panel can be employed. FIGS. 27 a and 27 b illustrate aright hand 2700 beginning a general “hand edge gesture” over one or moreUI elements 2710 appearing beneath sensor panel 2702 according toembodiments of this invention. In FIG. 27 b, the image of actual touch2704 may not look like a “hand edge gesture” at all, because only theuser's “pinky” finger and palm edge may actually make contact with thesensor panel. Images 2704 can together be converted into a singlefeature as described above.

FIG. 27 c shows image of touch 2704 as the right hand edge is swepttowards the user's body. FIG. 27 c shows intermediate touch images 2706and 2708, although it should be understood that a number of intermediatetouch images 1900 are generated as the right hand edge is swept. FIG. 27d shows an alternative in which the right hand is initially placed downin a horizontal position, creating touch image 2710, and then swept awayfrom the user, creating intermediate touch images 2712 and 2714. FIG. 27e shows another alternative in which the right hand is initially placeddown in a horizontal position, creating touch image 2716, and then movedaway from the user while maintaining the horizontal orientation,creating intermediate touch images 2718 and 2720. FIG. 27 f shows yetanother alternative in which the right hand is initially placed down ina horizontal position, creating touch image 2722, and then moved towardthe user while maintaining the horizontal orientation, creatingintermediate touch images 2724 and 2726. FIG. 27 g shows yet anotheralternative in which the right hand is initially placed down in avertical position, creating touch image 2728, and then moved to theright while maintaining the vertical orientation, creating intermediatetouch images 2730 and 2732. FIG. 27 h shows yet another alternative inwhich the right hand is initially placed down in a vertical position,creating touch image 2734, and then moved to the left while maintainingthe vertical orientation, creating intermediate touch images 2736 and2738. FIG. 27 i shows yet another alternative in which the right hand isinitially placed down in a vertical position, creating touch image 2740,and then moved up and down in a “sawing” motion while maintaining thevertical orientation, creating intermediate touch images 2742 (showntime-elapsed at times T1-T5 and visually separated for clarity—note thatthere can be no actual horizontal motion in the gesture). FIG. 27 jshows yet another alternative in which the right hand is initiallyplaced down in a vertical position, creating touch image 2744, and thenonly the pinky and one or more of the other non-thumb fingers arepartially curled or brought perpendicular to the thumb, creatingintermediate touch images 2746 and 2748. FIG. 27 k shows yet anotheralternative in which the right hand is initially placed down with thepalm in a vertical position and fingers partially curled orperpendicular to the thumb, creating touch image 2750, and then only thepinky and one or more of the other non-thumb fingers are straighten tovertical, creating intermediate touch images 2752 and 2754. In a “flickoff” embodiment, the pinky made end up not touching the sensor panel, inwhich case intermediate image 2754 would not be present.

If the images and corresponding features are tracked over time, thecompletion of various “hand edge gestures” can be accurately detected.At that point, one or more UI elements appearing on the touch screen atthe time the gesture was detected can be associated with the “hand edgegesture.” After a particular “hand edge gesture” has been detected andthe corresponding UI elements have been associated with the gesture,various actions can be taken with regard to the associated UI elements.For example, the detection of a hand edge gesture, without more, can beused to quickly “orient” an application or control along the elongatedaxis defined by the orientation of the touch image. A rotational sweepof the touch image from vertical to horizontal, can be used to “sweepaway” the UI elements, or dismiss the normal desktop to bring upalternative desktops, switch users, lock the screen, and the like. Asweep from horizontal to vertical, a straightening of fingers tovertical, or a flicking action can be used to reverse the operation,e.g. resume normal desktop/workspace. Sweeping the touch image from leftedge to right edge can be used to perform an erase operation (e.g. cleara drawing), switch to a workspace or virtual desktop on the right, orperform a continuous redo (or forward on a timeline). Sweeping the touchimage from right edge to left edge can be used to switch to aworkspace/virtual desktop on the left or perform a continuous undo (orbackup on a timeline). For instance, in a drawing application, thesegestures could undo/redo one pivot point at a time on drawn paths,polygons or bezier curves. Sweeping from top to bottom edge or viceversa can similarly clear a drawing, switch workspaces, users, orvirtual desktops, or hide an application or UI element, while sweepingfrom bottom to top can be used to maximize the application or UIelement. “Sawing” the gesture up and down can be used to verticallysplit the window or view.

FIGS. 28 a-28 b illustrate an exemplary “framing gesture” according toembodiments of this invention. A “framing gesture” formed by one or bothhands over a touch or hover sensitive device can be detected andinterpreted to perform operations requiring definition of a certainarea.

FIG. 28 a illustrates an exemplary orientation of hands 2800 giving a“framing gesture” over area 2802 according to embodiments of thisinvention. FIG. 28 b shows the image of actual touch 2806, which can beroughly the shape of an upside-down U. After the detected image 2806 hasbeen converted to one or more features and classified and grouped as a“framing gesture,” parameters for the feature can be computed, such asan approximate window or area 2802 framed by the “framing gesture,” andappropriate action can then be taken. Framed window 2802 can be used toopen a detail portal, magnifier, or dialog, or place a photo, forexample.

In any of these exemplary identification gestures, the images can beconverted to features, which can then be classified, grouped andinterpreted as an identification gesture. Each identification gesturecan then be associated with a particular user and UI element for whichaccess is sought, and depending on whether access rights have beensatisfied, the user may or may not be granted access to the UI element.

Although the present invention has been fully described in connectionwith embodiments thereof with reference to the accompanying drawings, itis to be noted that various changes and modifications will becomeapparent to those skilled in the art. Such changes and modifications areto be understood as being included within the scope of the presentinvention as defined by the appended claims.

What is claimed is:
 1. A method comprising: at an electronic device witha display and a sensor panel: detecting one or more images at the sensorpanel; determining that the one or more images are arranged in a patterncorresponding to a predetermined hand edge gesture; and in response tothe determination, aligning a user interface element displayed on thedisplay in accordance with an axis defined by the pattern correspondingto the predetermined hand edge gesture.
 2. The method of claim 1,wherein determining that the one or more images are arranged in thepattern corresponding to the predetermined hand edge gesture comprises:identifying a pinky feature in the one or more images; and identifying apalm edge feature in the one or more images.
 3. The method of claim 1,wherein the axis comprises an elongated axis of the hand edge gesture.4. The method of claim 1, wherein aligning the user interface elementdisplayed on the display in accordance with the axis defined by the oneor more images comprises aligning the user interface element with theaxis.
 5. The method of claim 1, wherein the one or more images compriseone or more touch images.
 6. A method comprising: at an electronicdevice with a display and a sensor panel: detecting one or more imagesat the sensor panel; determining that: the one or more images arearranged in a pattern corresponding to a predetermined hand edge gesturehaving a first orientation along a first axis on the sensor panel, andthe one or more images move along a second axis, different from thefirst axis, while maintaining the first orientation; and in response tothe determination, performing an action in accordance with the movementof the one or more images along the second axis.
 7. The method of claim6, wherein determining that the one or more images are arranged in thepattern corresponding to the predetermined hand edge gesture comprises:identifying a pinky feature in the one or more images; and identifying apalm edge feature in the one or more images.
 8. The method of claim 6,wherein: the sensor panel has: a left portion corresponding to a leftportion of the display, and a right portion corresponding to a rightportion of the display, the movement of the one or more images along thesecond axis comprises a movement from the left portion to the rightportion on the sensor panel, and the action comprises an eraseoperation.
 9. The method of claim 6, further comprising: displaying afirst workspace on the display, wherein: the sensor panel has: a leftportion corresponding to a left portion of the display, and a rightportion corresponding to a right portion of the display, the movement ofthe one or more images along the second axis comprises a movement fromthe left portion to the right portion on the sensor panel, and theaction comprises switching from displaying the first workspace on thedisplay to displaying a second workspace on the display, different fromthe first workspace.
 10. The method of claim 6, wherein: the sensorpanel has: a left portion corresponding to a left portion of thedisplay, and a right portion corresponding to a right portion of thedisplay, the movement of the one or more images along the second axiscomprises a movement from the left portion to the right portion on thesensor panel, and the action comprises performing a redo operation inaccordance with the movement of the one or more images.
 11. The methodof claim 6, wherein: the sensor panel has: a left portion correspondingto a left portion of the display, and a right portion corresponding to aright portion of the display, the movement of the one or more imagesalong the second axis comprises a movement from the left portion to theright portion on the sensor panel, and the action comprises movingforward on a timeline in accordance with the movement of the one or moreimages.
 12. The method of claim 6, further comprising: displaying afirst workspace on the display, wherein: the sensor panel has: a leftportion corresponding to a left portion of the display, and a rightportion corresponding to a right portion of the display, the movement ofthe one or more images along the second axis comprises a movement fromthe right portion to the left portion on the sensor panel, and theaction comprises switching from displaying the first workspace on thedisplay to displaying a second workspace on the display, different fromthe first workspace.
 13. The method of claim 6, wherein: the sensorpanel has: a left portion corresponding to a left portion of thedisplay, and a right portion corresponding to a right portion of thedisplay, the movement of the one or more images along the second axiscomprises a movement from the right portion to the left portion on thesensor panel, and the action comprises performing an undo operation inaccordance with the movement of the one or more images.
 14. The methodof claim 6, wherein: the sensor panel has: a left portion correspondingto a left portion of the display, and a right portion corresponding to aright portion of the display, the movement of the one or more imagesalong the second axis comprises a movement from the right portion to theleft portion on the sensor panel, and the action comprises movingbackward on a timeline in accordance with the movement of the one ormore images.
 15. The method of claim 6, further comprising: displaying afirst workspace on the display, wherein: the sensor panel has: a topportion corresponding to a top portion of the display, and a bottomportion corresponding to a bottom portion of the display, the movementof the one or more images along the second axis comprises a movementfrom the top portion to the bottom portion on the sensor panel, and theaction comprises switching from displaying the first workspace on thedisplay to displaying a second workspace on the display, different fromthe first workspace.
 16. The method of claim 6, wherein: the sensorpanel has: a top portion corresponding to a top portion of the display,and a bottom portion corresponding to a bottom portion of the display,the movement of the one or more images along the second axis comprises amovement from the top portion to the bottom portion on the sensor panel,and the action comprises hiding a user interface element on the display.17. The method of claim 6, wherein: the sensor panel has: a top portioncorresponding to a top portion of the display, and a bottom portioncorresponding to a bottom portion of the display, the movement of theone or more images along the second axis comprises a movement from thebottom portion to the top portion on the sensor panel, and the actioncomprises maximizing a user interface element on the display.
 18. Themethod of claim 6, wherein the one or more images comprise one or moretouch images.
 19. A non-transitory computer readable storage mediumstoring one or more programs, the one or more programs comprisinginstructions, which when executed by one or more processors of anelectronic device with a display and a sensor panel, cause theelectronic device to: detect one or more images at the sensor panel;determine that the one or more images are arranged in a patterncorresponding to a predetermined hand edge gesture; and in response tothe determination, align a user interface element displayed on thedisplay in accordance with an axis defined by the pattern correspondingto the predetermined hand edge gesture.
 20. A non-transitory computerreadable storage medium storing one or more programs, the one or moreprograms comprising instructions, which when executed by one or moreprocessors of an electronic device with a display and a sensor panel,cause the electronic device to: detect one or more images at the sensorpanel; determine that: the one or more images are arranged in a patterncorresponding to a predetermined hand edge gesture having a firstorientation along a first axis on the sensor panel, and the one or moreimages move along a second axis, different from the first axis, whilemaintaining the first orientation; and in response to the determination,perform an action in accordance with the movement of the one or moreimages along the second axis.
 21. An electronic device, comprising: oneor more processors; memory; a display; a sensor panel; and one or moreprograms, wherein the one or more programs are stored in the memory andconfigured to be executed by the one or more processors, the one or moreprograms including instructions for: detecting one or more images at thesensor panel; determining that the one or more images are arranged in apattern corresponding to a predetermined hand edge gesture; and inresponse to the determination, aligning a user interface elementdisplayed on the display in accordance with an axis defined by thepattern corresponding to the predetermined hand edge gesture.
 22. Anelectronic device, comprising: one or more processors; memory; adisplay; a sensor panel; and one or more programs, wherein the one ormore programs are stored in the memory and configured to be executed bythe one or more processors, the one or more programs includinginstructions for: detecting one or more images at the sensor panel;determining that: the one or more images are arranged in a patterncorresponding to a predetermined hand edge gesture having a firstorientation along a first axis on the sensor panel, and the one or moreimages move along a second axis, different from the first axis, whilemaintaining the first orientation; and in response to the determination,performing an action in accordance with the movement of the one or moreimages along the second axis.